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Abstract 

We consider complex sample covariance matrices Mn = j?YY* where 
Y is a N x p random matrix with i.i.d. entries Yij, 1 < i < N, 1 < j < p 
with distribution F. Under some regularity and decay assumption on F, 
we prove universality of some local eigenvalue statistics in the bulk of the 
spectrum in the limit where N — > oo and linijv->oo p/N = 7 for any real 
number 7 £ (0, 00). 

1 Introduction 

1.1 Model and result 

This paper is concerned with universal properties of large complex sample co- 
variance matrices in the bulk of the spectrum. We consider N x p random 
matrices Y = {Yij) where the Yij's are i.i.d. random variables with some prob- 
ability distribution F. Let then Mjv be the sample covariance matrix: 

M N = —YY*. 
N 

In the whole paper we assume that p> N and that 

37 6 [1, 00) such that p/N — > 7 as N — » 00. 



The case where p < N can be deduced from the above setting using the fact 
that YY* and Y*Y have the same non zero eigenvalues. In the sequel we call 
Ai > A 2 > • • • > An the ordered eigenvalues of and ttn := -k J2iLi ^ 
its spectral measure. Assuming that F has a finite variance a 2 , Marcenko and 
Pastur ( [19], see also [21]) have shown that irpj almost surely converges as 
N — » 00 to the so-called Marcenko-Pastur distribution p £2. This probability 
distribution depends on a 2 only and not on any other detail (higher moments 
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e.g.) of F: in this sense it is universal. It is denned by the density with respect 
to the Lebesgue measure: 



u + — x)(x — U-) 




(1.1) 



dx 



2itxo~ 2 



>-,«+]' 



where u + = a 2 (l + ^j) 2 and w_ = <r 2 (l - yfy) 2 . 

It has been conjectured (see [3U] for instance) that, in the large- iV-limit, some 
finer properties of the spectrum are also universal. For instance, the spacing 
between nearest neighbor eigenvalues in the vicinity of a point u £ (ii-,u + ) is 
expected to be universal, under the sole assumption that the variance of F is 
finite. The spacing is actually believed to be "more universal" than the limiting 
Marcenko-Pastur distribution, in the sense that it is expected to be the same 
as for Hcrmitian Wigner matrices. To investigate such local properties of the 
spectrum, we introduce the so-called local eigenvalue statistics in the bulk of the 
spectrum. Given a symmetric function / £ L°°(R m ) (m fixed) with compact 
support, a point u £ and a scaling factor ppj, we define the local 

eigenvalue statistic S^\f,u,p N ) by 



where the sum is over all distinct indices from {1, . . . , N}. When u is in the bulk 
of the spectrum, that is u £ (w_,u + ), the natural choice for the scaling factor 
is pn = Np^ 2 (u), which should give the scale of the spacing between nearest 
neighbor eigenvalues in the vicinity of u. 

We here prove universality of some local linear statistics in the bulk of the 
spectrum for a wide class of complex sample covariance matrices. We follow the 
approach used by [15] (and a series of papers [13], [12], [14]) where universality 
in the bulk of Wigner matrices is proved. We now define the class of matrices 
under consideration in this paper. Let p, be the real Gaussian distribution with 
mean and variance 1/2. Let F be a complex probability distribution whose 
real and imaginary parts are of the form 



SN{f,u,p N )= ^2 f(pN(\i-u),...,p N (\i 



«))> 



(1.2) 



l\ ,. . . ,2 



v(dx) 



e- v ^p{dx) 



(1.3) 



for some real function V satisfying the following assumptions: 
- V £ C 6 and there exists an integer k > 1 such that 
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(1.4) 



there exist S%, C > such that VieR, 



(x) < C'e- 5l|x|2 . 



(1.5) 
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This assumption can actually be relaxed as explained in Remark 11.21 below to 
consider distributions v with sub-exponential decay only. 
- F is normalized so that 

J xdv{x) = and J \x 2 \dv{x) = 1/2. (1.6) 

In the sequel we consider sample covariance matrices Mm — jjYY* where 
Y = (Yij) is a N x p random matrix such that: 

Yij, 1 < i < N; 1 < j < p are i.i.d. random variables with distribution F. 

(1.7) 

One shall remark that the condition (|1.6p can always be achieved by rescaling 
and does not impact on the generality of our next results. 
We now give our two main results. Let e > small be given. 

Theorem 1.1. Assume that F satisfies \1.3\) to \1.6\) . Let also u e (it_ + 
e, it+ — e) be a point in the bulk of the spectrum and pn — N P^i ( u )- Then 



lim ES$(f,u,p N ) = ( f(x,y)(l- (^-^ — ^) )dxdy. 



Remark 1.1. It is also possible to prove universality of local eigenvalue statistics 
for m = 3 using the approach developed hereafter. Nevertheless to consider 
higher integers m, one needs to increase the regularity of V (see Remark 1.1 
in [15]). 

We can also prove that the spacing distribution close to a point u in the 
interior of the support of Marcenko-Pastur's law is universal. Let s > and 
(tjv) be a sequence such that liuiN^ootN = +oo and limN^oo J-fLp — for 
some j3 > 0. Let u e (u_ + e, u + — e) be given and pn = Np^f (u). Define then 
the "spacing function" of the eigenvalues by 

S N (s,u) := -j-|t (l < j < N - 1, X j+1 - Xj < —, \Xj — u\<—\. (1.8) 
ztN I Pn Pn ) 

Intuitively the expectation of the spacing function is the probability, knowing 
that there exists an eigenvalue in the interval [u — i/v, u + 1^] , to find its nearest 
neighbor within a distance — . Finally, we define 



P( s ) = ^2 det ( 7 -^)i 2 (o,a), where K(x,y) := ^ - y) (1 ' 9) 



Theorem 1.2. Assume that F satisfies U.3\) to U.6\) . Let also u € [u- + 
e, w+ — e) be a point in the bulk of the spectrum. Then, 

lim ES N (s,u)= [ p(w)dw. (1.10) 

JV^OO Jr. 
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Remark 1.2. Using truncation and centralization techniques, it is possible to 
prove both Theorem 11.11 and Theorem 11.21 when Assumption 11.51 is replaced by 
the weaker assumption 

3Ci,C 2 > such that v{x) < de' ^ 1 . 

This extension is examined in full detail in Section 5 of [15] for Wigner random 
matrices and readily extends to sample covariance matrices. 

The first proofs of universality in the bulk of the spectrum of large random 
matrices have been obtained for the so-called invariant ensembles ( [TJ, [8], [5]). 
Their proof relics on the fact that the joint eigenvalue density of such ensem- 
bles can be computed and the asymptotic local eigenvalue statistics can then 
be determined. A breakthrough in the proof of the conjecture was obtained 
in [TB] (following the idea of [5] and [B]), proving universality in the bulk for the 
so-called Dyson Brownian motion model ( QT]). This then allowed to extend 
universality results to a wide class of non invariant Hermitian random matrix 
ensembles- the so-called Gauss divisible ensembles (see Subsection 11.21 for the 
definition). [4] have then obtained the same universality result for Gauss divisi- 
ble complex sample covariance matrices. Very recently, such universality results 
have been greatly improved by |23j and |15) for Hermitian Wigner random ma- 
trices. Both the papers remove the Gauss divisible assumption and only assume 
sufficient decay of the entries of Wigner random matrices. The approach of [33J 
is to make a Taylor expansion of local eigenvalue statistics of Wigner matrices. 
The core of the proof is then to show that these statistics depend, in the large- 
N- limit, only on the first four moments of the entries of the Wigner matrix. 
Proving that any Wigner matrix can be matched to a Gauss divisible matrix 
with the same first 4 moments allows to prove a very general universality result. 
On the other hand, the approach of [15] is to show that any Wigner matrix 
(under suitable decay of the entries) can be sufficiently well approximated by a 
Gauss divisible random matrix, so that the bulk universality follows. We also 
refer the reader to [16] where the two approaches are combined. 
While writing this paper, a proof of universality in the bulk of the spectrum 
of large sample covariance matrices has been obtained in ^3] in the sole case 
where p — N = O(N^s), but with much milder assumptions on the decay of the 
distribution v. Therein v satifies / \x\ c °dv(x) < oo for some sufficiently large 
C in place of our assumptions (|1.4I) and (11.51) . Their approach is based on the 
ideas developed in [23 . 

Our paper closely follows the ideas developed in [15] . We give an overview of 
the proof in the next subsection. 

1.2 The idea of the proof 

Following the pioneering work of [TB] and [5] , [3] have shown universality of local 
eigenvalue statistics in the bulk of the spectrum for complex sample covariance 
matrices when the distribution of the sample is Gauss divisible. We recall that 
a complex distribution (1q is Gauss divisible if there exist a complex probability 
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distribution P and a non trivial complex Gaussian distribution G such that 
Hq = P * G. Equivalently [4] consider random matrices of the form 

M N = — (W + aX) (W + aX)* , (1.11) 

where W and X are independent N x p complex random matrices both with 
i.i.d. entries. The Wy 's are P distributed and the X y 's are complex standard 
Gaussian random variables. In the above context a is real number independent 
of N. The proof of [T5] and [2] relies on three main steps: 

-conditionnally on H = j?WW*, the eigenvalue process induced by Mjv defines 
a so-called determinantal random point field; 

-the corresponding correlation kernel can be expressed as a double integral in 
the complex plane depending on H through its sole spectral measure [In] 
-under suitable assumptions on W and thanks to concentration results for the 
spectral measure fiN established by [2], [3] and [17], the asymptotic analysis of 
the correlation kernel (and local statistics) can be performed. 
The parameter a is to be seen as the "order of the Gaussian regularization" of 
P. In principle this result would yield a full proof of the universality conjecture 
if one could let a approach (and be smaller than) 1/y/N. Unfortunately this 
idea fails whatever sharp concentration results can be established for /Ujy The 
asymptotic analysis is not stable in this scale. 

A breakthrough to overcome this difficulty is obtained in [15]. In [13], [12] . 
[Ti] concentration results are deeply sharpened so as to be able to consider a 
Gaussian regularization of order a >> -^=. Given an arbitrary (non Gauss 
divisible) distribution F, the main point is however to show that one can find 
a Gauss divisible distribution approximating F sufficiently well so that one can 
deduce universality in the bulk for a sample covariance matrix with i.i.d. F 
distributed entries. This is the main step achieved in |15j . which we now briefly 
expose. 

Consider the Ornstein-Uhlenbeck (OU) process 

1 d 2 x d 

Given a complex probability distribution Jl, let then X t be the N x p matrix 
process with initial distribution Jx® N P and whose entries (real and imaginary 
parts) evolve independently according to the OU process. Then X t evolves as 

*^e-*/ 2 (H + {e l -l) l / 2 X 

where H has i.i.d. entries with distribution Jl and X has standard complex 
Gaussian entries. Let e tc := (e tL )® N 'P denote the dynamics of the OU process 
for all the matrix elements. Here, for small t, one should think that t ~ a 2 . 
[15] prove that there exists a Gauss divisible distribution tf G approximating F 
in the total variation norm in a sufficiently good way. Roughly speaking, let 

f/ G = e tL (l -tL + tL 2 /2)F c , Jl=(l-tL + t 2 L 2 /2)F c 
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where F c is obtained from F after truncation and centralization. Intuitively, 
it is reasonable to expect that fj,Q is a good approximation of F in the scale 
t « 1. For sake of completeness we here recall their result (Proposition 2.1). 

Let 9 be a smooth cutoff function satisfying 6(x) = 1 if |je| < 1 and 0(x) = 
for |x| > 2. 

Proposition 1.1. Let V satisfy {l-4\) , U.5)) for some k and il.6}) . Let A > be 

sufficiently small and t = iV A_1 . Let cn, d^ be real numbers so that vdfi defines 
a centered probability density if the function v is given by: 

v(x) := e- vc (' x \ V c {x) := V(x)6 ((x - c N )N- x ^ k ^j + d N . 

Let then C := L® Np , f N:P := (e"^)®^, and f c , N , p := v® n p. 

1. There exist constants C > 0, c > depending on k and X such that 
J\fc t N, P -fN, P W® Np <Ce- c ^) c/ \ 

2. gt '■= (1 — tL + t 2 L 2 /2)v is a probability measure with respect to d^x. Setting 
Gt '■— [9t]® Np , there exists a constant C depending on A and the constants 
Cst and Si defined in \1.J$ and M.5\) such that 

[ \ etCG t - fc,N,p\ 2 d ®N P < CN t 6-X < CN -4+8X^^ 

J e tc G t N 

The idea is then to prove Theorem 11.11 and Theorem 11.21 for the Gauss 
divisible ensembles with small parameter a ~ iV^ A_1 ^ 2 for some A > 0. Then, 
using Proposition ll.il and following the idea of [TS] Section 4, one can extend 
universality of local eigenvalue statistics Sj^ 1 (/, u, pn) with m = 2, 3 and that 
of the spacing function to sample covariance matrices satisfying (|1.7p . 

The paper is organized as follows. In Section [2] we study eigenvalue statitics 
in the bulk of the spectrum for Gauss divisible sample covariance matrices. To 
this aim, we first recall some properties of the Deformed Wishart ensemble. 
This is the conditional distribution of Mn knowing W . Such an ensemble is in 
particular known to be determinantal, as we recall. We then establish some im- 
proved convergence rates for the spectral measure of sample covariance matrices 
whose entries have a sub-Gaussian tail. These concentration results then allow 
to compute the asymptotic correlation functions as N —¥ oo in the regime where 
a — > 0, a >> ^t=. We then prove Theorem II .21 and Theorem 1 1.1 1 in Section [3] 
In the whole paper, we use C and c to denote constants whose value may vary 
from line to line. 

2 The Gauss divisible ensemble 

In this section we establish some universality results for the following Gauss 
divisible ensemble: let 

M N = — (W + aX)(W + aX)*, (2.1) 
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be a Hcrmitian N x N random matrix where 



• (Ho) X and W are independent N xp random matrices where p > N and 
there exists 7 > 1 so that limN^oo p/N = 7. 




where A > is a (small) real number; 



• (H2) X is a N xp random matrix with complex standard Gaussian entries: 

RcY, ; . lm.V,, - 7V(0, 1/2), V 1 <i<N,l<j<p; 

• (H3) W — (Wij) , 1 < i < N, 1 < j < p, is a complex random matrix such 
that Re Wij, ImWij, 1 < i < N,l < j < p are i.i.d. and satisfy : 

(Ai) There exists a constant S a > such that Ee 5 "' 1 ** 3 '' < 00, V«,j. 

Without loss of generality, we also make the assumption that 

E|W 4J | 2 = l/4. (2.2) 

Note that (|2.2[) can always be achieved by rescaling Mm- From now on, we 
denote by y\ > yi > ■ ■ ■ > un the ordered eigenvalues of H — Hn = WW*/N 
and let //at = jj^2^LiS Vi be its spectral measure. Under assumption (|2.2[) . it 
is known that /ijv almost surely converges to the Marcenko-Pastur distribution 
with parameters 7 and 1/4, p^y 4 — P* /P , whose density function is given by 
(fTTTf) with a 2 = 1/4. When 7 = 1, we simply denote pf fp by p MP for short. 

2.1 Correlation functions 

The sample covariance Gauss divisible ensemble has a nice mathematical struc- 
ture_that we are going to make use of in the sequel: the conditional distribution 
of Mm with respect to W is the so-called complex deformed Wishart ensemble. 
Such an ensemble has been widely studied in random matrix theory as it induces 
a determinantal random point field. We recall some results used in [J] (see ref- 
erences therein) that will be needed for the sequel. Let then (Ai, A2, . . . , \n) 
be the joint eigenvalue distribution induced by the conditional distribution of 
w.r.t. H. Then (see Section 3 in [4]), P$ is absolutely continuous with 
respect to Lebesgue measure on R^. Its density f§ is given by: 

^-m^-^m (t)x- 

(2.3) 

where V(x) := rii<i<j<Ar( a:: i — x j)> v = P — N, l v is the modified Bessel func- 
tion of the first kind, and S — a 2 /N. The main tools to study local eigen- 
value statistics are the so-called eigenvalue correlation functions (see Section 
[3]) . They are defined for any integer 1 < m < N by (x\ , X2, • ■ • , x m ; H ) = 
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(JV-m)! ftt^-™ f" (^1^2, • ■ • ,X m , A m+ i, ■ ■ ■ , Ajy) H^Lm+l dX i- TnCn > for ^ [n ~ 

teger 1 < m < N, one has that 

R ( x ) (x 1 ,x 2 , ■ ■ ■ , x m ; H) = det {K N {x u ar^-; #))™- =1 , 

for some correlation kernel Kjy(u,v; H). This gives the determinantal random 
point held structure. Furthermore Kn(u,v; H) is given by 



w — z w + z 



where i4T„ is the modihed Bcsscl function of the second kind. T is a contour ori- 

1 /2 

ented counterclockwise enclosing the ±y i ' ,i = 1, . . . ,N and A is large enough 
so that the two contours T and T = iR + A do not cross each other. 



-Vi 



1/2 



1/2 

y 2 



1/2 



1/2 

■Vn 



1/2 



2/1 



1/2 



Figure 1: The two contours dehning the correlation kernel Kjy(u,v;H). 

Due to the determinantal structure, correlation functions are not impacted 
by conjugation of the correlation kernel. In particular, for any b € K and any 
integer m > 1, 



det (K N (ui,Uj;H))" L = det I K N (ui,uy H)e s 



In the sequel we consider the conjugation for some b that will be defined in the 
asymptotic analysis. We set 



K b N ( Ui , Uj ; H) := K N (u u uy H)e 2b ^~^l s . 



(2.5) 
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The correlation kernel K b N {u, v; H) only depends on H through its spectral 
measure fi n ■ As in the case where a is fixed (independent of N) , the idea is to use 
the convergence of /xjv to the Marcenko-Pastur distribution. In particular, one 
would like to make the replacement (outside a suitably negligible set) Y\f =1 {w 2 — 
j/i) = exp{iV(l + o(l)) J\n(w 2 — y)dp^ rp (y)}. In the next subsection, we prove 
that this replacement can be made in some sense provided Im(u; 2 ) is not too 
small. 



2.2 Concentration results 

This subsection is devoted to the proof of the following Proposition 12. 1[ which 
precises the rate of convergence of /in to p MP . Before exposing this Proposition 
we need a few definitions and notations. Given a complex number z = u + ir), 
u € R, 77 > 0, we define the Stieltjes transform of /ijv by 

m N (z) := J - j- - dnN(X), (2.6) 

and the Stieltjes transform of the limiting Marcenko-Pastur distribution by 



m MP {z) := / ^—dp" P {\). (2.7) 



Let e > be given (small enough). 



Proposition 2.1. Let z = u + irj for some u G [u— + e,M+ — e] and r\ > 0. 

Then, there exist a constant c\ and C ,C > 0, c > depending on e only such 
that V<5 < c±e, 



sup 

uE[ii_+e,u+- e] 



m N {z)-m M p{z)\ > (£ + Co|-^-7|)J < Ce^ 5 ^, 



for any (InN^/N < rj < 1. Furthermore, given rj > (ln7V) 4 /7V, there exist 
constants c > 0, C > and such that \/k > K Q , 

pf sup |mAr(a; + iy)| > k ] < Ce' c " /7n ^. 

\ |x|>(e/200) 2 ,y>J) / 



Proof of Proposition I2.lt To ease the reading, the proof is postponed to 
Appendix A. The proof is actually a modification of that of Theorems 3.1 and 4.1 
in [T3], where the Stieltjes transform of Hermitian Wigner matrices is considered. 
We thus simply indicate the main changes. 



2.3 Asymptotics of the correlation kernel when v = p — N 
is bounded 

The aim of this subsection is here to prove the following Proposition. 
Let e > be given (small) independent of N. 
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Proposition 2.2. Let u* € [v,- + e,u + — e] = [e, 1 — e] and m > 1 be given. 
Consider a sequence u = ujy such that N l ~^\u — u*\ = o(l). Then, there exist 
i)€ R, a set f2jv 7 o,nd positive constants C, c such that: 
-the complement offl^ is negligible: P(tt c N ) < Ce~ cN 1 
-on fijv one /ias i/iai 



Remark 2.1. In Remark 12.31 we explain how to extend Proposition 12.21 to the 
case where \u — u„\ = o(N~ c ) with c > arbitrarily small. This extension is 
needed for the proof of Theorem 11.21 

This subsection is devoted to the proof of Proposition 12.21 The proof is 
divided into two parts: first we obtain a new expression for the correlation 
kernel, which then allows to derive its asymptotics by a saddle point argument. 

2.3.1 Rewriting the kernel 

Our strategy consists into two parts: we first replace the Bessel functions with 
their asymptotic expansion and then remove the singularity \/{w — z) in the 
correlation kernel as it will be proved to prevent a direct saddle point analysis. 
As this part is highly technical and needs a few notations, we summarize our 
main result in Lemma |2 . 31 stated at the end of this subsection. The reader might 
skip this part and is simply referred to the above cited Lemma. 

Our first task to replace in (|2.4|) Bessel functions with their asymptotic 
expansion given in Appendix [B] formula (|B.1[) . This replacement can be made, 
up to a negligible error, provided the contour T is cut in a small neighborhood 
of the imaginary axis: see Figure [2j In the sequel (see Figure [2]) we call 
(resp. xf) the endpoints of T c with Re(x G ) < (resp. Re(xi) > 0) with 
positive/negative imaginary part surrounding the imaginary axis: x^ and xf 1 
will be chosen in subsection 12.3.21 We also call Ti (resp. I^) the part of T c 
lying to the right (resp. left) of iM.. Then Ti is the image of T2 by z i~> — z and 
r c = Ti U T2- One obtains the following Lemma. 

Lemma 2.1. For any b £ R, one has that 





(2.8) 



where 
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1/2 

-Vi 



-Vi 



1/2 



1/2 



1/2 



1/2 

Vl 



1/2 

fa 



T 



Figure 2: The contour T c has been cut along the imaginary axis. 



2b/ rrN e s / 7 / 7 ~ z +" 2 ™y^ _l£v5 

4i 2 7T 2 S 



Krf (u, v; H) = — - , 9 _ 9 - j dw I dze s e s e 



iR+A Jr 



N n 

1 jt w - Vi (wyw + z in+iv/2 

VW«M 1/4 1\ z 2 ~Vi ' 

Proof of Lemma I2.lt Here we will call on some arguments already used 
in [3] , which we won't develop entirely, yet trying to be the most self-contained 
as possible. Lemma [2.11 essentially follows from the two claims exposed in the 
sequel and whose proof relies on the saddle point analysis in the next subsection. 

Claim 2.1. One has that 



r^co 2i 2 "n*S z J iR+ A Jr\r c \ S / V S J 

11 z — m \ z J \ w — z w + z ) 



N 
N 

yi \ z ) \w — z w 

Claim [2~T1 essentially follows from the fact that < e z if Re(z) > 

(and a similar bound for K^) and the fact that T \ T c lies far away from the 
critical points (a full justification can be derived from the saddle point analysis 
performed in Subsection 12.3. 2[ see Remark [ 

Claim 2.2. One has that 
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N o 

w 1 — yi ( w\ v I w + z w — z 



Z 2 — Vi V z / 



^ z'-i/j \ z/ \w — z w + z 



dw I d 



4i 2 7r 2 S' 7 lR+A J Tl " (uv)i 



wdz 



n ^ (-)" (— - ^ ) (i + o(iv-v 2) ) . (2.1D 



By a straightforward change of variables, we can reduce F c to Y\. Claim 
then follows by uniform asymptotic expansions of first the w— integral, using the 
asymptotics of the Bessel function K u given in (|B.1|) . and then of the z— integral. 
Again this follows from the saddle point analysis of Subsection l2.3.2l This saddle 
point analysis will prove that the correlation kernel does not vanish in the large 
N- limit. Then splitting T\ into sufficiently many pieces lying to the left or to 
the right of the critical points and moving accordingly T = iM. + A to pass close 
enough to the same critical points yields that the error term is of order N~^ 2 . 
This yields Claim 12.21 and Lemma 12.11 We skip the detail. □ 

A saddle point analysis of the correlation kernel K 1 ^ cannot be performed at 
that point, due to the singularity l/(w — z). Indeed, assuming that IliLi^ 2 — 

yi) = e J N\n(w 2 -y)dp MP (y)(l+o(l)) ^ j t jg nQt difficult to see that the z _ and 

uu— integrands have the same critical points. Thus we first remove this sin- 
gularity (see [1] for a more detailed explanation). 
Define 

N 

f»{z) :=z 2 -2V^z + sY / Hz 2 -y l ), f N (z) = f^(z) + 2bV^ 

i=l 
N 

f»{z) := z 2 - 2V^z + Sj2Mz 2 -Vi), fN(z) = f?(z) + 2brf 

, , ( ^ (//)>) , 

g N ,u{ w ' z ) ■= ( w - °) 1" Uu ) (z), 

w — z 

2( m -6)(yTT- s /S) 

1 , , vbS Sb w — z e s —1 

9n,u\w,z) = + — , 6{w,b) = -— tt7-7= 7=C) 

wz Zwz w + z 2(w — b){y/u — tJv) 

2(z-b)( v ^+ v /^) 

^(z, b) = % — i-= ==r. (2.12) 

^ ^ 2{z-b){^+ y/v) V ' 

Lemma 2.2. One has that 



N 

1 f , f , ImM^ImM 

dw / dze s 
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w + z 

X ■ 



(( . z (g^ u (w,z)+g 1 N Jw,z)) 9(w,b) (2.13) 



(x 1 - b) f w + x 1 ( w \" /jvW-Jjv^i ) 

' du> p= — e s 6(w, b) 



4i 2 7r 2 (u«) 1 /4 y iR+yl ^_ rr ^ 



1 ' p= — r e s («;,&). 



(2.14) 



Setting K 2 /(u,v;H) := e-^~^/ 2 K^ b (u, v; H), we have that 
K^ b (u,v; H) = 

e s f f iiW^sii) 

,. 2 2g / aw dz{g N , v (w,z) + g Nv (w,z)) e 3 

44 7T a JiR+A JTi 

w + z (wy lf 

Uiy^uu) 1 / 4 \ Z ) 
e - b) f W + X 1 ( W\ ) ,, _ 

=!== I — I e s 6 L (x 1 ,b) 



4i 2 7T 2 (wi>) 1 /4 



e (a;+ —b)f w + xf fN^y^N&j} ( w 



rfw - ' 7i_ e "" ' s ( -^r ) X (a;+,6). 

(2.15) 



Proof of Lemma 12.21 We only consider K]f (the arguments are similar 
for K z N b ). To this aim we make the change of variables w = b + (3(w' — b), 
z = b + /3(z' — b) for some j3 real close to 1 and get the following : set 



N , N 

-2yT7m -p- 1- „ ^-iygi 



E(w) := e ^r^ JJ( W 2 _ yi ) ; G ( z ) = e ^-^ [](z 2 - w ). (2.16) 
Then one has that 

l i/(ui) {wy W + Z 



Kh b (u,v:H) =— — i / dw I dz ■ — — W . (—') — 

W {uv^Ai^S Jm+A Jr, ^ G{z) \ z ) w-z 



■idirdr. 



<HWS(uv)i JiR+A M y/(b + P(u/-b))(b + P(z'-b)) 
(b + p{w' - b) Y 2b + g(y + - 26) E(b + - 6)) 



\b + P{z'-b)J w'-z' G(b + P(z' - 6)) 

' dwdz (wyw + z E{w) 



( 



Ai 2 TT 2 S(uv)i JiR+A J x+ y/wz\zJ w - z G(z) 

r b+fi(x--b) dwdz w + zE(w) 



2b(v^7-y^r) 

e s 



Ai 2 TT 2 S(uv) 4 7iK+A ii- v 7 ^ ^ z ^ w-z G(z) 



(!) 



13 



(2.17) 

Differentiating with respect to (3 (close to 1), we find that (see [TH] Section 2 for 
the detail) 



e s f f 1 /tc\ 1/ M) + z£(i()) 



- 2(iu + 2 - 6) + 20* + £ > — — r (2.18) 

^65 Sb w — z 
wz 2wz w + z/ 

[x± — b)e s / 1 ^w^w + x^ £/(w) 



4i 2 7r 2 S' 7 iR+A I + \x+ J w-x+G{x+) 



( , 7 _ &)e ^P^ /■ ^ 1 = ^.(2.19) 



4i 2 7r 2 5 j iR+A <f^x7\ x iJ w-x 1 G(x 1 ) 



j i 

By the definition of f^, the term in the bracket (j2 . 18[) can also be written 

( w -b)(f?y( w )-(z-b)(f?y(z) 



w — z 

- (w _ b ) (fu )'(w) ~ (fu)'(z) _ U Ny {z) _ (2 2Q) 



$lM= — 



Integrating by parts yields Lemma 12721 □ 

To summarize this highly technical subsection, we have proved that 
Lemma 2.3. Assume that v is a bounded integer. Then 

K b N (u, v: H) = (1 + o(l)) (K]f{u, v; H) + K 2 /(u, v; H)) , 
where the kernels Kjj and K^ b are defined in Lemma \2.2i 



2.3.2 Saddle point analysis of the correlation kernel. 

We are now in position to perform the saddle point analysis of the correlation 
kernel K^(u,v; H) and prove Proposition 12.21 Let r be a given real number 
independent of N and assume that v — u = r/ (Np MP (u)). We mainly focus on 
the asymptotics of the first term (|2.13[) in K 1 ^ 1 '. The asymptotic expansion of 
the two other terms (|2.14j) in K^f and of K^f will be an easy corrollary of the 
arguments used hereafter. 
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We first examine the saddle point analysis for the approximate exponential 
term : 



fH=fuH := w 2 - 2(w - b)^i + a 2 J Hw 2 -y)dp MF {y). (2.21) 

Note that / is the almost sure limit of the exponential term arising in the 
definition of (|2.13|) and should thus (approximately) lead its asymptotic analysis. 
Then using the fact that, when 7 = 1, a Marcenko-Pastur random variable has 
the same law as a squared semi-circle random variable, one gets that 

2w 



f'(w) := 2w - 2V^I + a 2 / — dp MP (y) = 2w-2^I + 4a 2 (w - ^w 2 - 1). 

J w 2 -y 

(2.22) 

Given a point u G [e, 1 — e], e > it is easy to see that / admits two conjugate 
critical points: 



w 



_ (1 + 2a 2 ) y / u± 2a 2 Wl + 4a 2 U , — 2 ■ / — MP 1 



l + 4a 2 



;±a 2 nr^p MP {u) + 0(a 2 ). 



More precisely, one has that lm(wf) = ±a 2 n^/up MP (u) + 0(a 4 ), where the O 
is uniform in the bulk (depending on e only). One can also check that 

f"(wf) = 2(1 + 2a 2 ) + = 2 + 0(a 2 ), 

where the O(-) is uniform as long as u £ [e, 1 — e], e > and depends on e only. 
We now give the relevant contours for a saddle point analysis of the approximate 
exponential terms. Let T = Re(w+) + it,t € K and Li = 1^ U with 
r i~ = {wt(t),t G [§,1-§]}U{w+(1 - §)+x,x > 0}U{iIm(u;+(f))+x,e/180 < 
x < Re(w ( t(|))}. Then it is easy to check that Re/ achieves its maximum (resp. 
minimum) at w^r on T (resp. T). 

We now turn to the saddle point analysis for the true exponential term: 

2 N 

f N (w) := w 2 - 2(w - b)yfc +^Y. ln ( w2 - V*)' ( 2 - 23 ) 

One has that f' N (w) = 2(w — *Ju) — 2wa 2 mM{w 2 ). Using the concentration 
results of Proposition 12.11 we now show that the first derivatives of Jn and / 
are close (on a suitable set). Let C > be a very small number and set 



S, 



, ( = {z = x + iy,xG [ u _ + -i_ )U+ -|] ) (a 2 <y< l}, (2.24) 



with u- = and u + = 1 here as 7 = 1. Define for H = = w ^ , 

tt N = { \m N (z) - m(z)\ < N^r, if ^ < Re{z) < 1 - |, ^a 2 t < Im(z) < 1; 
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mpyi(H) < K; sup \m N (x + iy)\<K\. (2.25) 

l^l>(2rar) 2 ^>C« 2 j 

Using Proposition 12.11 and Lemma 7.3 in [13], we can deduce that there exist K 
large enough and constants c, C > such that 

P(fifr) < Ce- cJvV4 . 

From now on we assume that -Hat € f2jv- As a consequence, there exists C > 
such that for any to G S e ^, 

Cn 2 

\f N ( w )-f( w )\<j^. (2.26) 
By Cauchy's formula, one can then deduce that for any integer / > 1 

\f^H-f^H\< a2il S )Nm - (2-27) 
Here the value of the constant C varies from line to line. 

Let us now show that /jv admits two conjugate critical points w^ N which are 
very close to wf. The proof directly follows the arguments of |15| . Section 3. 
The critical points of /at are the solutions of the fixed point equation: 

o N 

i— a x w 

w = F N (w) = — 



This equation clearly admits 2N — 1 real solutions which are interlaced with 
— ^/y N \/y 1 , y/y 1 , ■ ■ ■ , ^/V N ■ It admits also two non real solutions w c N . We 

now show that w^ N are very close to wf. Set F(w) = y/u — a 2 J J±_ y dp MP (y). 
Define 6 = {z £ C, |Re(z) - 0I| < C a 2 , (a 2 < Im(z) < C a 2 }, for some 
large constant C . Then, on 6, \F N - F | < Ca 2 N~ x / i . As on 9, Re(F(z)) = 
^/Ti+0{a 2 ) and ka(F(z)) = a 2 T7^/Tip MP {u) + o(a 2 ) we deduce that F N (Q) C 9. 
Now it is an easy consequence of (|2.27[) that Fn is a contraction: one has 
that \F' N (w) - F'(w)\ < CN~ x / i . Thus F N admits a unique fixed point with 
Im(w~^ N ) > (a 2 . Furthermore, it is an easy fact that 

i _i_ i | C a 

Kn-K\ < ^71- 

We now slightly modify the contours T and T for the saddle point analysis of 
/n- Set Tjy = j^Re^^y) + it,t € k|. Then Tat lies within a C 1 — distance of 
at most Ca 2 N~ x / 4 from T. Furthermore, setting w = Re(w ( f N ) + it, 



K 1 N % i {\w-^y % \ 2 + \w + ^y i \ 2 )}- 
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By the monotonicity of t i-> J2iLi { \w-l/y.p + \w+]/y.\^ ) > we conclude that 
Re(/Ar) is maximum on Tjy at w^ N . More precisely, using that Re/^(iu) > 1 

,21 



for any w S T n {Im(w) > (a }, one has that, if t stands for Im(w), 

Re(f N (w) - f N (w+ N )) = -Re / sds f^(wf N + ius)du 



< _('-*■■<<»»', (2 , 8) 

This implies in particular that 

Re (f N (Re(w+ N ) + iC,a 2 ) - f N (w+ N )) < - (( - a- 2 Im(w+ N )) * a 4 /4. 

Let then w G T n {^,0 < Im(z) < £a 2 }.Then, with a slight abuse of notation 
when Im(w) = 0, 



exp 



{/ivH - In (Re«w) + *Ca 2 ) } 



If one chooses (, in (|2.24|) small enough so that for any v € [e, 1 — e], (a 2 < 
Im(wj"jy)/8, one can then deduce that the contribution of the contour T n 
{z,Im(z) < (a 2 } is negligible. Thus, using (I2.28|) . the main contribution to the 
w— integral comes from a neighborhood of width of the critical point. 
We now turn to the z— contour. Let 



r + - 

1 N — 



{w+ N (t), e/2 < t < 1 - e/2} U [w+ N (1 - e/2) + x, x > o} 
u{iIm(w+ N (e/2) ) + x, e/180 < x < Re(w+ N (e/2) )}. (2.29) 

We can now define x^ := e/180 ± iJm (w~^ N ^e/2^ . The choice of 180 here is 

arbitrary: we only need a sufficiently large constant. We also set xf — —xf. 
By construction, for any t, | < t < 1 — 4, 

Re(-fN(w+ N (t)) + f N (w+ N )) < -ciV^-Vi) 2 , 

for some constant c > small enough. This follows from the fact that for any 
t, | < t < 1 - § , Im^+^i)) > (a 2 and Re f = 1+ 9 % a2) . Moreover 



R e f/iv(^^(l — §)) ~ /jv( w ^tv(1 ~ f ) + — — ca;2 ' f° r somc constant c > 
small enough. This follows from the fact that on fijv a 2 zrriN{z 2 ) — > uniformly 
along Tjv as mjv is bounded. Thus the contribution of {w* N (l — §)+x, x > 0} is 
negligible as TV — > oo. For the same reason, the contribution of {ilm(w^ N (^ j) + 
x, e/180 < x < Re(wj" Ar (|))} is negligible in the large N limit. As for the 
w— integral, the main contribution to the z— integral comes from a neighborhood 
of width ^fS of the critical point. 
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Remark 2.2. This analysis justifies Claim I2TT1 as Re 

for some c > 0. One can choose L \ L c as {x^ + ia 2 t, < t < A — Im(i|)} U 
{x+ + ia 2 t,0 < t < A- Im^)} U {x + ia 2 A,Re(x+) < x < Re(x^)} plus 
its conjugate, for some large enough constant A. There exists C > such that 
R- e {Jn(xi) — Jn(z)) < Ca 2 , Vz € r\r c , thus yielding a negligible contribution. 



We can now conclude to the asymptotic expansion of the correlation kernel 
and prove Proposition ^. 21 Let be as in Proposition [ 



We first consider the asymptotics for the first term (I2.13|) in K^f. We now fix 
b as follows: 



Re 



( w Xn( u *)) ■ 



Thanks to this choice, \Ke(w^ N ) — b\ = o(N x ~ 1 ). Therefore the function 9 has 
no impact on the saddle point argument exposed in the above, neither do the 
functions <?jv,u and gjf u . At the critical points, one deduces from (|2.12j) that: 

9N,u( W t^ w tff) = ( W c,N ~ b )fN( w tN) ~ * Im ( w tN) iwK^v); 



«n> b) = ? Ar ^ _ -(1 + o(l)) 



2Tm (w^'^J - \A>) 



Let us now consider the 4 combined contributions of the different critical points. 

V c,N 



The contribution to (|2.13[) from z = w = w^ N gives (at the leading order) 



2 „„..,± 



±i / V2nS \ 1 + s e ^ -1 



( UV y/± f KJV) (y/u- 0) 



(2.30) 



The contribution to (j2 . 13[) from z = w = w^ N is 

0(ATi-V2) due to the fact 
that gN,u anneals at that point. Combining the above yields that: 

lim . (f2T3ll = lim ,!„ . . (l2~30l) = !^i!H2. 

JV^oo jVp MP ( u ) 1 » Np M P 1 * 7rr 

The contributions of the two other terms in (|2.14l) is exponentially small 
since there exists a constant c > depending on e such that: 



Re 



This finishes the asymptotic expansion of K^f '. 

Let us now turn to the asymptotics for K^i '. The function 1 is not bounded 



N 



Nevertheless there exists x > depending on e such that 



e — s0 1 (z,b) 



< 
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e xS . This follows from the fact that Re(& — z) < b along the contour IV- 
One can now use the same saddle point arguments as for the study of to 
show that 

w -l^- ) K 2 /(u,v;H)=0(e-^- 1 ). 



This is enough to ensure Proposition 12.21 □ 

Remark 2.3. In the case where \u — u*\ = o(N~ c ) for some c < 1 — A, we choose 
a sequence u(N) such that N^^u - u(N)\ -> 0, |u* - u(N)\ = o(7V~ c ) and 
u(N) € [u- + e, u + — e] for all N . We can then replace — > u(N) in all the 

asymptotic analysis (in particular b = b(N) = Re (w* N (u(N))\). This has no 

impact on the validity of the arguments. 

2.4 The case where v — > oo 

We now turn to the case where v is unbounded (that is either 7 = 1 and v — > oo 
or 7 7^ 1) and study the asymptotics of the correlation kernel (|2.4p . The aim of 
this section is to prove the following Proposition. Let e > be given (small). 

Proposition 2.3. Assume that 7 > 1 and A < 1/2. Let € [it_ + e, u+ — e] 
and w = u(N) be a sequence such that limjv->oo ^V 1_a (m — u*) = 0. T/ien i/iere 
exists b £ K smc/i i/iaf outside a set of negligible probability, 

1 / r . etn sin(Trr) 



li m , T no, , K N (u,u+ — .- D . 



Remark 2.4. Proposition 12.31 extends to the case where \u — u*| = o(A r c ) for 
some c < 1 — A by the same arguments as in Remark 12.31 

The whole subsection is devoted to the proof of Proposition 12.31 The proof 
follows the same steps as in subsection 12.3.21 and we explain the main changes 
only. 

2.4.1 Rewriting the kernel 

Again the basic argument is to replace Bessel functions with their asymptotic 
expansion (large order large argument) and then derive the asymptotics of the 
correlation kernel by a saddle point argument. 

We again cut the contour T in a small neighborhood of the imaginary axis as 
on Figure [21 We also call T c this cut contour and the endpoints , x J will 
be defined in the sequel. Let us denote again Ti the part of r c lying to the 
right of the imaginary axis, so that T c := T\ U (— Ti). Wc now consider the 
uniform asymptotic expansion (|B.2I) of Appendix B. We assume for a while that 
we can replace the modified Bessel functions with their asymptotic expansion in 
the correlation kernel (this will be proved in Lemma 12.41 below) . Thus, setting 



v = u ■ 



we consider the kernel: 



JVpM-P(u) - 

2 +2(^-t,)vu-2(^-t,)yT7 



K N (u,v;H):= J. — ryrj / dwf dz- 
4i 2 ir 2 S(uv) L / 4 J lR+A J Fl 
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N 9 

e^^d/-V.) TT (2.31) 
± - L z 2 — in \ z / w — z 



z — Vj \ z J w — z 
i=i yl 



„2_,2, 



+2(^-b) v A7-2( m -b) N A? 

e s 

dw I dz- 



TV 

z 2 - y, 



N 9 

nw — Vi ( w\ u w — z 
z 2 - in \ z I w + z ' 



We call again K^f (resp. K^f ) the kernel given by (|2.31[) (resp. I2.32[) . Set 

h- l) 2 N x 

V(w,z) = exp{ — (1/w-l/z)}. 

4u* 

We can now slightly modify the arguments of subsection 12 . 3 . 1 1 to deduce that: 

1 f , f fMw)-fNM , . 

dw I dze M>[w,z) 



iR+A 



" + Z - {g NiU (w, z) + g 2 N<u (w, z)) (^) " 6(w, b) (2.33) 



W\/Z 



(x 1 -b) f w + x, ( w \ v fw^-fif^! ) _ 
1 dw p= — e s 9(w,b)^(w,x 1 ) 



4*2^ {uv) i/4 y. R+A 

(4-b) f ^, w + x+ ( w \" J»™-'»<'t> of ^ ,.„! f, 



4i 2 7T 2 (Mv) 1 / 4 /ttt. /~+ 



cZu; p= — r e s 6(w,b)^(w,xJ), 



(2.34) 

where 

2 / \ i , N SW A ( 7 - 1) 2 / 1 + 
9N,u(u, *) = 9 N ,u(w, z) + ^ [~ ~ b-^ j . 

We leave K^ b unchanged since the singularity l/(w + z) will not prevent its 
direct saddle point analysis. 



2.4.2 Saddle point analysis 

We shall now perform the saddle point analysis of the correlation kernels here- 
above. The arguments follow closely those of Subsection 12.31 The approximate 
exponential term to be considered here is given by 

h u {z) := z 2 - 2y/Ti(z -b) + a 2 J ln(z 2 - y)dp^ p {y) + a 2 (7 - 1) lnz. 

We are indeed going to show that the perturbative term coming from \& does 
not play a role in the asymptotics. Let m 7 (z) be the Stieltjes transform of the 
Marcenko-Pastur distribution p^ IP . One can check that 

m , w . - 2 + i_l + 2 vi=-(i + y?)y4)(,- t i-^m 

2z z 
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The function h u admits two non real critical points which are conjugate 



w. 



± = wf{u) = Vu(l - 2a 2 ) ± ia 2 nV^P™ P (u) + 0(a 4 ). (2.35) 



We now define the contours relevant for the saddle point analysis of the ap- 
proximate exponential term. Let T = Re(it;+) + it, t G R be oriented positively 
from bottom to top. The contour Ti is defined as in Section 12.3.21 we set 
Ti = U where Ti is oriented counterclockwise and 

T+ = {w+(t),t e [u- + e/90,u+ - e/2]| U |w+(u+ - e/2) + x,x > o| 
u{am(w+ (u- + ^) ) +x,e/l80 <x< Be(w+ («_ + ^) )}• 

We first consider the z-contour. There exists c(e) > depending on e only such 
that for all points t £ [it- + e/90, u + — e/2] in the bulk of the Marcenko-Pastur 
distribution Re (/i"(w+(t))) > c(e). Using the fact that 

-^Re (^,(«,+ (t))) = (VI- V^)Re * +f+ .. , (2.36) 

— Re(/i u ) achieves its maximum at z — wf along the first part of Tf . There now 
remains to show that — Re(/i u ) also decreases along |u;+(u + — e/2) + x,x > oj 

and jiu+(u_ + ^j)— a;, < x < Re^w+ (u_ + ^j) )~y§g j- This assertion follows 

from the fact that m 7 is bounded along this curve. Using the same arguments 
as in Section 12.3.21 it is easily seen that Re(/i n ) admits its maximum on T at 
wf. All the above reasoning holds unchanged if 7 is replaced by p/N. Thus, 
without loss of generality, we assume in the rest of this section that p/N = 7. 



We now consider the true exponential term. We set 

2 N 

r 
N 



h N , u (z) = h N (z) := z 2 - 2^/u(z - b) + — 2J hi(z 2 - jfc) + a 2 (j - 1) In; 



Again we can show that, provided H belongs to a suitable set, the derivatives 
of h u and Hn,u are very close and that hjy lU admits two non real critical points 
w cN very close to wf. Let £ > be given (small) and S e ^ be given by (|2.24j) . 
Define for H = H N - 



N 



^at, 7 = \ supyi(H) < K; sup |mjv(x + iy)\ < K; 

\mi\r(z) — m(z)\ < N~ , Vz with Qa 2 e < Im(z) < 1 
andu_ + ^<Re(z)< U+ -|}. (2.37) 

Using Proposition 12 . II and Lemma 7.3 in [12], we can deduce that there exist K 
large enough and constants c, C > such that 
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From now on we assume that Hn G ^N,-y- Mutatis mutandis, all the argu- 
ments used to consider the true exponential term when v is bounded (starting 
with (|2.26p ) can be copied to consider the true exponential term when v is not 
bounded. In particular, one has that 



\<n - «£l < ^ KU W ) ~ ^°HI < a2(; _ 2)jvA/4 v/ > l, V«, e S eX . 

We now consider the subsequent modified contours. We set Tjv = UT^ with 
T+ = [w+jf&lu- + ^ < t < u+ - |} U {w+ N (u+ - I) +x,x > 0} 

u{ 4 Im (<*(«- + ^)) +a:,c/180<aj<Ite(«>+^(u_ + ^)) }; 
T w = Re (w+ N ) + it, t G R. (2.38) 



Accordingly, we now choose x^ = ±ilm(w£ N (it_ + ^j) j + j§q and x^ — —xf. 

Then the main contribution to the exponential term of (|2.33[) comes from a 
neighborhood of width \/~S of w^ N for both the w— and z— integral. Thus one 
is left with showing that the function \E' does not impact on the saddle point 
analysis. Let O w — {w' £ C, \w' — w\ < N a \/~S} for some a that we determine 
hereafter. In order to ensure that VP has no impact on the saddle point analysis 
outside O w ± , it is enough that a > A/2. Now one can check that 



^{w,z) - 1 



< CN^/ 2 ^- 1 « 1, 



if a and A are small enough (in particular A < 1/2). This finishes the proof 
that the main contribution to (12.331) comes from a neighborhood of width \~S 
of w^ N for both the w— and z— integral. 

We now set b = Re (w^ N (u*)^J , so that b — w^ N — o(N x ~ 1 ), which implies 

in particular that the function 9 does not impact the saddle point argument. 
Let us now consider the contribution of the functions g^^ u {w,z) + g 2 N u {w,z) 
and 9(w, z) close to the critical points. One has that 

9nA W c,N> W c,n) + 9 2 N (Wc,N> W c,n) = ( W c,N ~ b ) ^O^Ar) + °( S ) 

SN x ( 1 -l) 2 w± N ~2b 

= (<N-b) h%(w± N ) + 0(SN x ); 
9nA w c,n> w Zn) + 9 2 n(w*n> w Zn) = 0(SN x ); 

0{w% Nl b) = j \ — — (1 + o(l)). 

We can copy the end of the proof of Proposition 12.21 to show that on Q, jy i7 
v 1 iAf>< tts sin(Trr) 

JV^oo NpM P (u) N ITT 
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To study the asymptotics of , we use the same saddle point analysis ar- 
gument. Note that this is possible since Tjvn(— Tn) = 0- We consider as leading 
exponential term for both the w- and z-integrals the function /ijv,« as above. 
Note that the w-integrand has then a "perturbative term" e 2 (« J - fe )(V«-v : K)/.S 
which does not play a role in the saddle point analysis. It is easy to deduce that 



lim 



NpM p {u) 



K^ b (u, v; H) 



0. 



This follows from the fact that the contribution from equal critical points is neg- 
ligible (due to the (w — z) factor), while that of conjugate critical points is of or 

' s s- 1 (fcj V , u (^ w («))-ft JV ,,(t«^( v )) ■ 



der 1 /N due to the rescaling and the fact that 
C for some C > independent of TV. 



< 



Thus we are now left with showing the following Lemma 
Lemma 2.4. One has that 

1 



lim 

N-+oc 



Np 



MP! 



K N {u,v-H)-K b N {u,v-H) 



Proof of Lemma 12. 4t Note that it is enough to show that \Kn(u, v; H) — 
K b N (u,v; H)\ = o(N). First Claim |2~T1 can be translated with no modification 
to the case where v is unbounded. Also we can reduce T c to Ti in K b N using the 
change of variables z4-z. There remains to prove the counterpart of Claim 
Set 



2^/uz 



( 7 -l)a 2 ' * (7-l)a 2 
A[(Z) 



W 



2y^ 



( 7 -l)a 2 



A' 2 (W) r- 



A,{Z) - 
A 2 (W) 



(7- l)a 2 



^Jl-nvZ 



In Kn we have replaced the Bessel functions I v and K v with the "approxi- 
mations" A\ and A 2 . It is an easy computation that, along Tjv and Tjv, one 
has: 



I„(yZ) 



A[(Z) 



1 



< 0(S), 



K v {vW) 



A' 2 (W) 



1 



< O(S). 



(2.39) 



We first consider the contribution of T l N :— Tn H {Re(xi) < Re(z) < Re(u>j" JV )} 
to Kn(u,v; H) — K b N {u,v\H) where the new contour T^, is slightly deformed 
around WT N so that d(TN,T l N ) = e'\/~S for some small e' (see Figure [3]). It is 
then easy to deduce from (|2.39l) and the previous saddle point analysis that 



2b( % A7- v ^T) 

e s 

2i 2 7r 2 ,S* 2 



dw 



dz 



N 9 

fr w - y% (wy 

Z 2 — yi V Z J 
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T 

N 



Figure 3: The contour T^r has been deformed around w^ N . 



A' 2 {W)I V Q^) - A' 2 (W)A[(Z)^ 

< o{Vs). 



(2.40) 



Indeed replacing A\ and/or A 2 with A[ and/or A' 2 does not impact on the 
saddle point argument. Similarly 



2b( v /?- N AT) 

e s 



2i 2 ir 2 S 2 



N o 

dw I dz e^^TT — -(- 

z s — yi \ z I w — z 



< 0(VS), (2.41) 



by using (|2.39l) . the fact that I v {vZ)/A' 1 {Z) is uniformly bounded for N large 
enough and the previous saddle point analysis. Lastly 



26(,A7-v'S) 



2i 2 7T 2 5 2 



dw dz (A 2 (W)A 1 (Z) - A' 2 {W)A^{Z)) 



N 2 

"i r W — Di fW\ v W-\-Z 

1 J. Z 2 _ y \ Z J 

i=l yl 



Z Vi V2/ w — z 



( N^/S\u-u*\ | N*\u-u,\ | jy\/2 
\ e'VS N a y/S J 

(2.42) 

< O ( A^«+2A-i + N ^- a + N $\ _ ( N y (2.43) 
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In (|2.42[) we have separated the cases where d(w, z) > N a */S or not, where z £ 
T l N and w £ and used the fact that \u — v\ ~ iV _1 and \u — uJ = o(N x ^ 1 ). 
Replacing S±| with ^| in (|2T4U| . (l2Hj) and (|2T4l?l) yields similar estimates. 

We now turn to the contribution of T r N :~ n {Re(z) > Re(wj" w } to 
Kn(u, v; H) — K h N {u, v; H). Note that Tjv can be moved to the left of Re{w~^ N ) 
up to adding the residue at w = z. Define Yjf as the contour obtained by 
reflecting with respect to the line Re(z) = Re(w^ N ). Then 



dw dz e s 



N , 

+™ 2 i r w — yi rw\ v w + z 

1 1 Z 2 _ y \ Z ) 



A+iK, JT- N t=\ Vi W ~ Z 



(k^)i^)-a[ { z)a> { w)) 

*■ z 2 — in \ z J 



N 

, -z 2 +m 2 -f-f w z — yi ( w\ v w + z 

dw dz e s 1 



(^(xWt)- a ^ 

(2^44) 

'2z^/v\ (2z\/u\ Ai,r7\Ai( tzy/v 



+ 2iJ r dz2z{K v {^f) I„(^f)-A[(Z)A>( 



a*( 7 -l) 



(2.45) 



The analysis of (|2.44l) is similar to that of T l N flT^. We thus have to consider 
(|2T4"5j) . One has that 



= O (sj r dz2zA[(Z)A' 2 ( J^ ) j . (2.46) 



As the integrand in the r.h.s of (|2.46l) has no singularity, one can thus move T r N 
to the line joining the two critical points w c N . Using now that \b-Re(w+ N ) \ = 
o(N x ~ 1 ), one can easily see that there exists a constant C > such that 



e — s — (Mi s- 2 < c. 

One gets similar estimates replacing ^| with j^rf- This yields Lemma [2~4l □ 



3 Proof of Theorem 11.11 and Theorem 11.21 

We only give the proof of Theorem 11.11 when 7 = 1 and v is bounded. The 
extension to other parameters 7 is straightforward. The proof of universality 
for the spacing distribution can easily be deduced from [TB] and [T5] and is based 
on the extension of Proposition 12.21 given in Remark 1 2. II Let then / € L°°(R 2 ) 

( 2^ 

with compact support and S N (f,u, pn) be denned by (|1.2[> with u G [e, 1 — e] 
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and pn — N p MP (u). Then if E denotes the expectation w.r.t. the distribution 
of Mat in (|2~Tj) . 

ES^\f,u,p N ) = f dP N (H) f fg(x 1 ,...,x rf )S%\f,u,fa)[x]dx' T , 



where (resp. Sjp(f,u,p?t)[-]) is the density function (|2.3[) (resp. local 

eigenvalue statistic) of the deformed Wishart ensemble. Then, using (|2.25p and 
setting 

dP N (H) = ^dP N (H)l He n N) 
where Zn is the normalizing constant, we get that 3C > such that 



dP N (H) I fg(x)S%\f,u,pZ)[x]dx 



<CiY 2 |/Ue- cJvA/4 = (l). 



(3.1) 



Now, 



lim 

N— too 



lim 



N- 



dP N (H) 
dP N (H) 



f^(x)S < ^\f,u,p N )[x]dx 



N 



f(h,t 2 ) (2) 

~2 U N 

R 2 PJV 

sin7r(ii — t% 
Wi - t 2 ) 




Pn Pn 



dt\dt 2 



where we used the definition of correlation functions, Proposition 
fact that / has compact support. 



(3.2) 
and the 



There now remains to extend the result to non Gauss divisible ensembles in 
order to prove the full Theorem ll.il The argument exactly follows the arguments 
of [TS]. We recall them for seek of completeness. (|3.ip and (|3.2[) show that the 
sine kernel holds for the complex measure e tc G t dn® Np ® e tc G t dp® Np if t — 
N~ 1+x . More precisely, let pM : t{x) denote the density function of the eigenvalues 

(2) 

x = [x\, . . . , a; v) w.r.t. this measure and let R N t be its two point correlation 

(21 (21 

function. Similarly, we define pn, c {x) (resp. pn(x)) and R Nc (resp. R N F ) 
for the eigenvalue density and two point correlation function w.r.t. truncated 
complex measure F Ci n, p = fc,N, P dp® Np ® f c ,N, P d(J.® Np (resp. w.r.t. measure 
F N p = F® Np ). We also set in the following 



u(ti) = u H — and ufa) — u H — with — N pYi '( u )- 

Pn ~ Pn 



Then 



^!f(«(*i)>«(*2)J -fl^ t («(ti),«(*a)J f(h,t 2 )dt 1 dt 



?(2) 
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where 



CO == 



^(uCtO.uCia)) -#w^<*i)X*2))]/(*i,*a)<Ma 
(17) := J \R%l(u(h),u(t 2 )) - i2^(«(*i),«(t 2 )) | l/(*i,*a)|d*ie«a 



It is easy to see from Proposition II. II that 

GO <^ 2 ||/||oo2Ce- cJVC <C'e 



c'N° 



with some C", c' > as N — > oo. To estimate (II), we use the fact that 



(I/) 2 < 



I? 



(2) 
N,t 



R 



R 



) t («(ti),«(«a))|/(*i,*a)|*i*a 

(3.3) 



I? 



J t («(*i),«(*a))|/(ti,*a)|dti<fta 



(3.4) 



The factor (|3.4p is bounded using (I3.1[) and (|3.2[) . We now use the beautiful 
idea of [TS] to bound (|3.3I) . Set g := p^-f(u). As / is bounded, one has that 



< C 



< c 



< c 



N 2 g 2 
N 2 g 2 
N 2 g 2 



R( N,t( z >y) 

( PN,c( x ) 



RN,t( z iy) dzd y 

1/2 



1/2 



\PN,t(x) 

|e* G t — f c ,N,p\ 2 



~ 1 ) PN,t( x ) dx 

-11/2 



- tc G t 



< CN- 1+iX 7 



(3.5) 



where C > is a constant that varies from line to line. Here the basic argument 
is that the distance D(f, g) :— J \ f/g — l\ 2 g between two probability measures / 
and g decreases when taking marginals as well as when passing from the matrix 
ensemble to the induced joint eigenvalue density. Finally, we used the estimate 
of Proposition 11.11 This completes the proof of Theorem 11.11 □ 



A Appendix : Proof of Proposition 12.11 



To ease the reading, we here recall Proposition 12. II 

Proposition A.l. Let z = u + irj for some u € [u— + e, u + — e] and r/ > 0. 

Then, there exist a constant c\ and C D , C > 0, c > depending on e only such 
that V<5 < c%e, 



sup 

, u£\u—-\-e,u+ — e] 



m N (z)-m M p(z) >(5 + C |^- 7 |) <Ce~ cS ^, 
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for any (In N) A /N < r\ < 1. Furthermore, given rj > (In N^/N, there exist 
constants c > 0, C > and -fT smc/i £/ia£ Vk > if OJ 

P( sup \m N (x + iy)\ > k) < Ce' c ^^. 

\|x|>(e/200) 2 ,y>r, j 

Proof: The proof follows closely that of Theorems 3.1, 4.1 and 4.6 in [T4"] . 
We only prove here the first statement. The second one is simple adaptation 
of the latter statement and ideas given in the proof of Theorem 4.6 in [14] (see 
also the proof of Theorem 2.1 in [T5]). 

Using a discretization scheme with step 8/Arf (see [2]), it is possible to show 
that it is actually enough to prove that 

v(\m N (z)-m M p{z)\ > ($ + C |Jl-7|)) < Ce~ cS ^, 

for any (In N) A /N < r\ < 1 and for a given z = u + i?7 where u G [it_ + e, it + — e] . 
Denote by Ck the feth column of W/\N, Ck ■= ^j^(W)fe) then one has that: 

H N = J2CkC;. (A.l) 

fe=l 

We define also Rat(z) = (H^—zT)^ 1 and for any integer k = 1, . . . ,p, H^\z) := 
(Hn — CkC^ ~ zl)^ 1 . Then mjy(z) = jjTtILn(z) and one has that 

1 p 1 

l + zm N (z) = |- - — V ttt . (A.2) 

Equation (|A.2[) simply follows from (IA.1I) and the identity Hn{z)(Hn — zl) = I. 

(k) (k) (k) 

In addition we denote by y\ > y 2 ' > ■ ■ ■ > y N the ordered eigenvalues of 
HN—CkCl and set /Jjy :— jj J2iLi <^< fc > ■ By the well-known interlacing property 
of eigenvalues, for any k = 1, . . . ,p, y% > y[ k) > y 2 > > • • ■ > Vn > Vn ■ 
Denote by Fn (resp. F^) the p.d.f. of the spectral measure /xjv (resp. 
Then one has that 

NF N (x) - NF^\x) < 1, Va; e E. (A.3) 

Last call , i = 1, . . . , N a set of orthonormal eigenvectors associated to the 

(fe) 

ordered eigenvalues y\ and define 

^ :=\<v < t\VNC k >\\ 
From (|A.2|) . one gets that 

p 

X X 



1 P 1 
l + zmjv(z) = — . (A.4) 



1 I 1 V^ iV S 
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The latter formula (|A.4[) is the counterpart of formula (2.6) in [T3] ■ The proof 
of [M] can be summarized into 2 basic ideas. 

First the random vectors Ck are centered with i.i.d. entries and they are in- 
dependent of R]^ . It is known that the random variables Cl~R,^\z)Ck con- 
centrate around their means which is given by E|Wii pTrR^/TV. The speed 
of concentration is explicited under assumption (Ai) in Lemma 4.2 in |14) for 
Wigner matrices. The extension to Wishart matrices states as follows. 



Lemma A.l. Set X^ = X = j-j Y^k=i ~ m ~ where z = u + ir),u G [it_ + 



N 

e,u+ — e]. Then there exists a positive constant c (depending on e) so that for 
every S > we have 



F[\X\ > S] < 5 e ~ cmin { s V N, i/~*> s2N v/~t} 

if Nrj > (In TV) 2 and N is sufficiently large (independently of 5). 

The proof of Lemma IA.ll is postponed to the end of Appendix A. 

Remark A.l. Lemma |A. II is established under the assumption that the real and 
imaginary parts of the components of Ck are i.i.d. (in addition to the gaussian 
decay assumption (Ai)). This is the reason for our assumption (H3). 

One can then use (|A.3[) to show that for any k = 1, . . . ,p, |TrR^ — TrRjv| < 
rf 1 . By Lemma |A~T1 |A"W| < 6, Vfc = 1, . . . ,p in (|A.4j) with high probability. A 
bootstrap argument exposed in [12j . Section 2, yields that with high probability, 
the Stieltjes transform mAr(z) satisfies 

1 + zrn N (z) = — - — — + A, (A. 5) 

where |A| < C'5 for some C > is a small error term and a 2 = ElVt^nl 2 = 1/4. 
The second basic idea is the stability of the equation (|A.5[) . The equation 

1 + Zmfz) = £ - £ J— rr (A.6) 

admits a unique solution satisfying lva{m(z)) > whenever Im(z) > 0. This 
solution is m M p p / N (z), that is the Stieltjes transform of the Marcenko-Pastur 
distribution with parameter p/N. Now, the stability of equation ()A.5|) implies 
that there exists a constant C such that for any z G {z — u + ir], u G [u~ + 
e, u + ~e], (In N) 4 /N < v < 1}, 

\m N (z) - m MP , p / N (z)\ <CA. 

The constant C here depends on e only. Now, as \z — u± \ > e and l/\z\ = O(l), 
there exists C Q > depending on e only such that 

\m M p !p/N {z) - m MP (z)\ < C\p/N - 7 |. 

This finishes the proof of Proposition 12.11 provided we show Lemma IA.11 □ 
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Proof of Lemma IA.lt Define for n > 1 the intervals /„ = [u — 2 n ~ 1 r],u + 
2 n ~ 1 r]} and let M and Kq be sufficiently large numbers. We have [— Kq,Ko] C 
I no with n = Ci\n(K /rj) < C2^n(NK ) for some constants C\,Ci > 0. De- 
note by A the event: 

where for a given interval I, we denote by |7| its length and set Af(I) = §{i = 
1, . . . , N\yi E I}. Let then denote the probability w.r.t. Cfc. One has that 

P[\XW\ >6]< E(l A c¥ k [\X\ > 6}) + P(A). 

The first term can be handled as in [13] . It is proved in [14], Proposition 4.5, 
under assumption (Ai), and if r/ > 1/N, and for sufficiently large M and K , 
that there is a positive constant c = c(M, Kq) such that for any S > 

E (l A cP k (\X {k) \ > 6)) < 4 e -omin{5Vmj/y/^/N,5 2 N 2 r,/p}_ 

The above estimate requires again the full assumption (H3). 
We now turn to the estimate of P(A). This is the only part which has to be 
modified to consider sample covariance matrices. For this result, we need that 
(lniV) 2 /iV < 77 < 1. Then for sufficiently large M and Kq there are positive 
constants c, C such that for all N > 2, 

P(A) < c e - c V M "^. (A.7) 

To prove (|A.7|) . we first recall Lemma 7.3 of [13] ■ Let Y be a. N x p,p > N 
random matrix with i.i.d. centered entries with variance 1. If the entries of Y 
also satisfy assumption (Ai), there exists a positive constant c such that for 
C > large enough 

V(y max (YY*/p)>C)<e~ cCp . 
Thus we only have to consider 

P ( ( max ^ , > Ml) . 

\\n<n ^Np\I n \ ~ J J 

To this aim we consider an interval I of length 1 1\ = a for some a > rj and call 
u its midpoint. In the following we set z = u + ia and assume that \z\ > e for 
some e > small. Let < $ < 1/8 be given. For each k = 1, 2, . . . , N , we 
define the events 

B {h) := £ < UW) 1), 

and set B = UkB^. As the eigenvalues of Hn and are interlaced, at least 
J\f(I) — 1 eigenvalues of H^f lie in /. It is proved in Lemma 4.7 of [2j that 

3c > 0, P fe (5 (fc) ) < eW^W- 1 . 
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From this, we deduce that there exists C > such that if M is large enough, 



P (B n {N~(I) > Ma-y/Np}) < Ce~ c ^ MaVN¥ . 
Then on B° we have for some constant C2 > 
Af(I) < C 2 Nalm (m N (u + ia)) 



p/N - 1 1 y, 1 



= C 2 Nalm IU"- L -±Y 

\ z N f-f * 4- *rr*. R (fc) 



< C 2 Nalm ( -i V 1 ... | (A.8) 

< 4C ^^ 2 (A9) 

In (|A.8|) we used that Im(l/z) < and p/N > 1. To derive (|A.9[) . we used the 
fact that 

( 1 \ 1 1 
Im 77^ < < 



,(*), 



z + zC* K R^C k J \z\l + C* K R%>C k \z\lm(C* K B%>C k ) 



,(fc), 



with A < e 1 and we estimated from below 
m - 



N 



rn=l \ym Z\ 



This now implies that Af(I) < ol\/ kC^N p j ■& e on B c . This yields the desired 
result : one simply chooses M large enough. □ 

B Appendix : Asymptotics of Bessel functions 

We use two types of asymptotics for Bessel functions. 

The first one deals with is well-known asymptotics of Bessel functions with 
bounded order and large argument (see [22] e.g). They are used in Section [2~3l 
Assume that v is bounded. Then, for large \z\,z G C 

I v {z) = -i=(e* + e-.+C+Va)^! + (l/z)), - J < Arg z < ^; 

K v {z) = ^ e ~ Z (! + OQ./Z)), |Arg z| < ^. (B.l) 
V2z ^ 

We also make use in Section l2~H of asymptotics of Bessel functions with large 
order and large argument. Abramowitz and Stegun [T] (p 378) give the following 
uniform asymptotic expansion of modified Bessel functions of large order : 



OO 



/2tw(1 4-z 2 ) 1 / 4 



fe=i 
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t = 



K v (vz) 



1 



VMi + z 2 ) 1 ^ 



, ip = \/l + z 2 + In 




DO 



i + \/TTI2 



(-i) fe 



2 



M fc (i) 




(B.2) 



where Uk{t) = t k Vk(t) for some polynomial Vk- 
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